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Abstract 

Brownian motion with known positive drift is sampled in stages until 
it crosses a positive boundary a. A family of multistage samplers that con- 
trol the expected overshoot over the boundary by varying the stage size 
at each stage is shown to be optimal for large a, minimizing a linear com- 
bination of overshoot and number of stages. Applications to hypothesis 
testing are discussed. 

1 INTRODUCTION AND SUMMARY 

Many problems in theoretical and applied statistics involve observing a random 
process until it crosses a predetermined boundary. We consider a version of this 
classical problem in which Brownian motion X{t), with known drift /z > and 
variance per imit time 1, is sampled in stages until X(t) > a > at the end of 
a stage. As an example consider periodic monitoring of a pollutant in a water 
supply. There is a critical level for the pollutant above which some action must 
be taken but below which one will only decide when to test again, basing that 
decision on the current level. 

If one incurs a fixed cost for each unit sampled and an additional fixed cost for 
each stage, then a natural measure of the performance of a multistage sampler 
is the sum of these costs upon first crossing the boundary. In this paper we 
describe a family of samplers and show they are first-order optimal as a ^ oo. 

Many aspects of the boundary-crossing or "first-exit" problem are well- 
studied, though without the multistage aspects considered here. The power- 
ful methods of renewal theory address successive exits and the time between 
such events (sec Feller (1971), pages 358-388). Lorden (1970) obtained sharp, 
uniform bounds for the excess over the boundary of random walks. Siegmund 
(1985) discusses further applications in sequential analysis. 
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Schmitz (1993), Cressie (1993), and Morgan (1997) have proved general ex- 
istence results for a large class of multistage sampling problems. In particular, 
the theorems of Schmitz show that a optimal sampler does exist for the prob- 
lem considered here and that the optimum has the "renewal-type" property that 
at each stage it behaves as if it were starting from scratch, given the data so 
far. But these authors do not propose specific procedures, and though there is 
an extensive literature dealing with fully-sequential (one-at-a-time) and group- 
sequential (n-at-a-time) sampling, there have been few investigations of the 
performance of procedures that vary their sample size from stage to stage. 

The families of samplers constructed below, (5^ ^ and <5+ are shown to 
be first-order asymptotically optimal in Theorem |2.6[ They have variable stage 



sizes which decrease roughly as successive iterations of the function x >->■ \Jx log a;, 
while the average number of stages required is determined by the ratio of the 
cost per stage to the cost per unit time in relation to a family of critical func- 
tions, hm- These critical functions define "critical bands" - i.e., regions of the 
first quadrant which are closely related to how close any efficient procedure can 
be to the boundary after each stage of sampling; Lemma 2.7 gives a precise 
"in-probability" lower bound on this distance. Theorem 2.9 then provides a 
converse statement to the optimality of 5^ hi^m.z^ showing that any competing 
sampler must use at least as many stages and follow the same "schedule" of 
h described in Lemma 



2.7 



2 MULTISTAGE SAMPLERS 

Define a multistage sampling rule T to be a sequence of nonnegative random 
variables (Ti, T2, . . .) such that, for fc > 1 

Tk+i-l{Ti + --- + Tk<t} e£t foraUi>0, (2.1) 

where £t is the cl ass o f all random variables determined by {X{s) : s < t}. The 



interpretation of (2.1) is that by the time T*^ = Ti + ■ ■ ■ + Tk, the end of the 
first k stages, an observer who knows the values {X{s) : s <T'^} also knows the 
value of Tfe+i , the size of the (fc + l)st stage. By a convenient abuse of notation, 
we will also let T denote the total sampling time, T*^, where M = inf{m > 1 : 
X(T"^) > a}, the total number of stages required to cross the boundary a. We 
will then describe a multistage sampler by the pair (5(a) = {T,M), where the 
argument is the initial distance to the boundary. When there is no confusion 
as to which sampler is being used, the shorthand = X{T'^), Xq = will be 
employed. 

Let c, d > denote the cost per unit time and cost per stage, respectively, 
and consider the problem of finding the multistage sampler 5 {a) = (T, M) that 
minimizes 

c- ET + d- EM. 

Dividing through by c, this is seen to be equivalent to minimizing 

ET + h- EM, (2.2) 
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where h — d/c. By Wald's equation, 



ET = EX{T)ln = a/ 11 + E{X{T) -a)/n> a/fi, (2.3) 
so the sampler that minimizes 

E{T - a/n) + h- EM (2.4) 



also minimizes (2.2), and using (2.4) instead of (2.2) will also lead to a more 
refined "first order" asymptotic theory. 



To describe a sampler that asymptotically minimizes (2.4) to first-order, it 
suffices to consider sequences {(a, ft.)} such that a — oo. We are interested 
in problems where optimal procedures use a bounded number of stages and it 
turns out that this requires 

/i > a" 

for some e > 0. It will turn out that good procedures use m stages (almost 
always) if, as a — > cx), 

a(i/2)'"(loga)i/2-(i/2)'" « a(i/2)"-^(iog„)i/2-(i/2)'"-i^ (2.5) 

where means asymptotically of smaller order. We therefore define the 

critical functions 

/.„(x)^x(i/2)'"(logx)i/2-(i/2)" 

for m = 1, 2, . . . and a; > 1, with ho{x) = x. An essentially complete description 
of how to achieve asymptotic optimality is thus given by showing how to proceed 
in two cases. The case defined by (2.5) is called {(a, h)} being in the mth critical 
band. The other case is 

h ^ Qhm{a) 

for some Q S (0, cx)), which we refer to as {(a, /i)} being on the boundary between 
critical bands m and m + 1. 

It will prove convenient in the sequel to treat ft as a function of a. To 
translate the above formulation into these terms, let B'^ be the class of positive 
functions ft such that {(a, ft(a))} is in the mth critical band (for every sequence 
of a's approaching oo) and let i3+ be the class of positive functions ft such that 
{(a, ft(a))} is on the boundary between critical bands m and to + 1 (for every 
sequence of a's approaching oo). That is, 

= {ft : (0, oo) -> (0,oo)| hjn{x) ^ h{x) ^ hm-i{x) as x oo} , 

B+ = (ft : (0, oo) ^ (0, oo)| lim e (0, ex.) 

and let B,n ~ B°^ U S+. Our notation reflects that, as a — >• oo, the average 
number of stages used by an efficient sampler approaches 

m if he S°„ 
TO + 77 if ft e S+, 
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Figure 1. The critical bands and the average number of stages used by an 
efficient procedure. 



where 77 € (0, 1) is a function of hni2._j.00 h[x) / h^^x)] Figure 1 summarizes this 
relationship. Finally, we define the risk of a sampler 5{a) ~ (T, M) to be 

R{5{a)) = E{T - a/^i) + h{a) ■ EM (2.6) 

for a given h E Bm, some m > 1. Define the optimal sampler S*{a) — {T* , M*) 



to be one that achieves R*(a) = inf^ R{S{a)). Note that, by (2.3), the definition 



of risk (2.6 1 is equivalent to the expectation of a linear combination of the 
so-called "overshoot," X{T) — a, and the number of stages used. 

A convenient way of parametrizing stage sizes is by the upper standard 
normal quantile of the probability of stopping at the end of the stage. Thus, for 
X > and z e M let t{x, z) be the unique solution of 



A simple computation gives 



t{x, z) — xj — 



x-iAx^ = (2.7) 
^Jt{x,z) 

\J A-X^i + z^ — z^ 



Zy 



Letting Zp denote the upper p-quantile of standard normal curve, the probability 
of being across a boundary x units away at the end of a stage of size <(a;, Zp) is 
p. An important asymptotic property of t{x^ z) is that 

t(x, z) ~ x/ + 0(z\fx) 



4 



as X, \z\ — > oo if \z\ <^ ^fx. 

Letting $ and denote the standard normal distribution function and den- 
sity, define 

/•oo 

A(z) = / ^{-x)dx ^ (f){z) - ^{-z)z. 

J z 

The function A will appear often in calculations of expected overshoot or un- 
dershoot. For example, letting E{Y;A) denote E{Y ■ 1a), il Y ^ N{\,a^) 
then 

E{Y; Y >y) ~ / P{Y > w)dw + y ■ P{Y > y) integration by parts 

$ ( j + y • $ ( - 

a I ^{—v)dv + y ■ ^ ( ^~ ) change of variables 

-AMVy<i>f-^V (2.8) 



In particular, by taking y = 0, A = z) — x — —z^t{x, z), and a = \Jt{x~z) 
in (2.8) we have 

E\X{t{x, z)) - X] X{t{x, z))>x]^ ^t{x, z) ■ A(z). (2.9) 
Useful asymptotic properties of A are 

A(^)-jl:'\,. ^^^^r (2.10) 

I 0(z)/z , as z — ^ -l-oo. 
The former is trivial while the latter follows from the classical expansion 

$(-z)-^[l-z-2 + 0(^-4)]. (2.11) 



2.1 Geometric Sampling 

For z e M, let 5z{a) ~ {T,AI) be the sampler such that the probability of 
stopping at the end of each stage is constant at p = z) across the stages, 
i.e., 

Tk = t{a - Xk-i,z) ■ l{Xk-i < a}. 

Then M is a geometric random variable with mean 1/p and we thus refer to 
6z{a) as geometric sampling. Although p is constant across the stages, we do 
allow it to vary with a, the initial distance to the boundary. 

Not only is geometric sampling an interesting random process on its own, 
but it also has been conjectured that optimal multistage procedures share its 
stationarity property. While Theorem |2.9| will show this is not true, geometric 
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sampling will prove to be a useful tool for designing the final stages of our 
optimal samplers in the next section. 

Lemma |2.1| establishes a fundamental upper bound on the behavior of geo- 
metric sampling and Lemma |2.2| gives an asymptotic bound on the overshoot 
of X under geometric sampling when the probability of stopping at each stage 
approaches 1. With the exception of the the main results, Theorems |2.6| and 
|2.9[ the proofs of all theorems and lemmas can be found in the Appendix. 



Lemma 2.1. Let z E R, q = ^{z), and 

g{x) = ■ {x - iit{x, z)) = ^^^{^Axii + z^ - z). (2.12) 

qz Zjiq 

Then Sz (a) — (T, M) satisfies 

{ •'?(a) + M"^Ei>9 5*''Ha)9^ «/ z > 

I ^rk(4) Efc>i .9^^Ha)9 \ tfz<0, 

where g'^^^ denotes the kth iterate of g. 

Lemma 2.2. Let z{x) he a non-positive function such that lim^j-^oo z{x) = — oo 
and \z{x)\ = o{\/x) as x oo. Then 5z(a){o,) — {T,M) satisfies 

EXm -a = 0{\z{a)\ y/a) 

as a oo. 

2.2 The samplers and 

In this section we define two families of samplers that will later be shown to be 
first order optimal under different conditions. Namely, the sampler will be 
optimal when h e and ^ '^iU be optimal when h e B^. 

Let f{x) = {6/^)y/x\og{x + 1) and note that is well-defined since / 
is increasing. The family of samplers 5^ ^(a) are indexed by a positive integer 
m and a positive function /i, and the argument a is the initial distance to the 
boundary. We define the family of samplers inductively on m as follows. Letting 
A denote min, 

5U{a) = <5c(a)(fl), where ((x) = -{y^/x^/^ A x^/'). 
= 1st stage t{a, y^\og{a/h{aY + 1)), 

followed (if necessary) by (5°^ ho/-i ('^ ~ -'^i)- 

The family of samplers (5+ ^ (a) are indexed by a positive integer m and a 
number z e M. They are defined inductively on m as follows: 

^1 zi'^) ^ 1st stage i(a, z), followed (if necessary) by 

K(a-Xi){a - Xi), where v{x) = -^log(j: + 1). 
5in+i,z{a) = 1st stage t{a, ^{l - 2-") log(a + 1)), 
followed (if necessary) by (5+ ^(a — Xi). 
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The main theorems in this section estabhsh the operating characteristics of 
h ^^'i ^rn,z- Theorem 2.3 will show that 5'^ ^ uses m stages almost always 
and has overshoot bounded by /i, asymptotically. Theorem |2.5| will show that 
5^ ^ uses TO or m + 1 stages almost always, and that the probability of using to 
stages has upper quantile approaching z. 

Theorem 2.3. If h e then (5°„,;,(a) = (T,M) satisfies 

ET-a/^ = o{h{a)) (2.14) 
EM TO (2.15) 



as a —> oo. 



Before establishing the operating characteristics of (5+ ^ in Theorem 



2.2 



we 



introduce a family of positive constants that appear in the first order overshoot 



of 5+ 2 . For TO > 1 define 



m— 1 



= - n [(1/2)"-'-' - {Il2r-T'^y^ ■ (2.16) 

1=1 

The next lemma contains the key property of the Km that we will need. 
Lemma 2.4. For m>l, as x oo 



Km/lm(V(l-2 "')//i • Xlogx) ~ Km+lh„i+i{x) . 



We adopt the notation F < G to denote F < {1 + o(l)) • G. Theorem [2^ 
establishes the operating characteristics of (5+ ^ . 

Theorem 2.5. Lei m G N and z e M. T/ien (5+^ (a) = (T, M) satisfies 

ET~a/n < A{z)Kmh„r{a) (2.17) 
£;M ^ TO + $(z) (2.18) 

as a ^ CO. 

2.3 Optimality of^;;, and5+. 

In this section we state our main optimality results of the paper, Theorems |2.6| 
shows that 5^ ^ is first-order optimal when h g and 



and 



2.9 



Theorem 



2.6 



that ^* is first-order optimal when h € K^, where the value z* is deter- 



mined by h{x)/hm{x). Theorem 2.9 provides a converse to Theorem 

2.6 showing that any efficient sampler must behave like ^ and 6^ z'- 



Theorem 2.6. If h e B^, then 

R{S"M)-rn-h{a)^R*{a) (2.19) 
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oo . If h G B^-^ , then 



m + $(z*) + A(z*) 



{z*) 



1 - $(z*) 

as a ^ oo, where z* is the unique solution of 



h{a) - R*{a) 



= lim 



^rn 



l-<i)(z*) 2:^00 h{x) 



(2.20) 



(2.21) 



(k) 

Before proving Theorem 2.6 we introduce a family of iterated functions F^^ 
which are the order of magnitude of the best-possible undershoot after k stages, 

a—Xk- Lemma|2.7|makes this explicit and Lemma|2.8|establishes a link between 

(k) 

the Ff^ and the critical functions hk- 

For X > y"^ > define Fy{x) = ^yxlog{x/y'^). For a positive function h and 
X such that h'^{x) < x define 

Ff)(a;)^FW(x)|,.,(.). 
Note that h{-) is not iterated, e.g., 



[x\og{x/h^{x))] 



1/4 



log 



\og{x/h'^{x)) 



^ [xlog (^x/h\^x\og{x/h^x))) 



h?{x) 

1/4 



log 



^Jx\og{xlh'^{x)) 



\ "yh^i^xiogix/h^ix)))^ 



Lemma 



2.7 



establishes an "in probability" lower bound of the order i^^'^^ on 
le boundary any efhcient sampler can be after each of the first 



how close to t 
m — 1 stages when h G Bm- 

Lemma 2.7. If h (z Bm and S is any sampler such that R{S{a)) — 0{h{a)) 
then for any e > and < fc < to — 1 

Pfa-Xfc>(l-e).(l/M)^-^"Vf (a) 



1 



(2.22) 



as a CO. 



Lemma 



2.8 



shows that, when h G Bm, square roots of the iterates F^ ^\a) 
are roughly constant multiples of the critical functions hk. The constants 
themselves are given by the solutions of the following recurrence relation. For 
1 < k < m define C™ to be the unique solution of 

Cl!Xi = yfCf ■ [(l/2)'=-i - (1/2)™^!]!/"; = 1. 



(2.23) 

After taking logarithms, solving ( 2.23[ ) amounts to solving a difference equation. 
This computation gives 



k-l 



Cr = n [(1/2)'^^-^ - (1/2)"-^] 



(1/2)-^ 



(2.24) 



i=l 
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where it is understood that an empty product equals 1. Note also that 

= (2.25) 

Lemma 2.8. If h e then 



Fh^^\x)^Crhkix) as x^oo, for l<k< 



If he B°„ then 



hk{x) 



< C™ as X oo, for 1 < k < m. 



(2.26) 



(2.27) 



Proof of Theorem 



2.6 



holds by Theorem |2.3[ Now 



Assume that h e B^. The left hand side of (2.191 



R*{a) < R{S°yiji{aj) the Bayes property 
= 0{h{a)) 



2.3 



by Theorem 
sampled process, 



so Lemma 



2.7 



applies to S*{a). Letting denote the S*- 



R*{a) > h{a)EM* 

> h{a)mP{M* > m) 

= h{a)mP{a~ X*^_^>0) 

> h{a)mP{a-X*^_,>{l/2){l/t,) 
^ h(a)m ■ 1 



by Lemma 2.7 proving (2.19). 

If /i G then lim2._j.00 h.{x)/hm{x) is positive and finite, hence 



hm — — = K„ 
is positive and finite as well. The function 



lim 



h{x) 



z ^ 



(^) 



1 - $(z) 



increases from to cx) as z ranges from —00 to 00, so the equation (2.21 1 has a 
unique solution, z* . Theorem |2.5| shows that 



i?((5+,2.(a)) ^ A(z*)K™/i,„(a) + /i(a)[m + $(z*)] 



A(z* 



1 - $(z*) 



/i(a) + h{a)[m + <f>(z*)] by definition of z* 



m + $(z*) + A(z*) 



1 - $(z*) 



/i(a) 
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Suppose e > 0. Again R*{a) < i?((5^ ^.(a)) = 0{h{a)), so by Lemma 2.7 

> (1 -^\a)) ^ 1. (2.28) 

Let (r*(™\M*(™)) denote the continuation of 6* after the (m - l)st stage, i.e., 

M*(™) = [Ar-(m-l)] + , 
j.*(m) ^ T* - (Ti* + • • • + T,;_i), 

and for y > define 

^(y) - -y) + /i(a)Ar(")|a - X;„_i = y]. 

We will show below that (fiy) is non-decreasing in y. Let 



7=(l-e)(l/Ai) 



We now compute a lower bound for 1^9(7). Letting 

p^P(Ar(™) = i|a-x;;_i = 7), 

we have 

pi"^S(Xjv/.(™) - (a - X*,_i)|a - X;„_i = 7) 

> ^l-'E[{XMH^.> -{a- X*^_,))1{M<"'^ = l}|a - X*^_, - 7] 



= ^i-'Hz,)^]{l-e){lh,Y-^-^'Flr-'\a) 

- A(zp)\/1 - e • (I/m)^"^ "Ci^mCa) by Lemma [5: 
= A(zp)\/1 - e • Kmh,n{a) by (|2.25[) 



A(zp)Vl - e • Y" 



-h{a), 



(2.29) 



by definition of 2;*. Also, i?(M*(™^ |a — =7) > 2-p, and combining this 

with (2.29) gives, for sufficiently large a. 



iz*) 



> 



2-p 



h{a) • (1 - e) 



/i(a) • (1 - 2e). 



(2.30) 
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Letting Y = a — ^*„_i and V = {Y > 7}, we have 



R*{a) = E[^i-^{XM- -a) + h{a)M*] 

> E[^l-\XMH^) -Y) + h(a)(m - 1 + Ar^™)); V] 

> E[ip{Y) + {m - l)h{a)]V] 

> [ip{"f) + {ni — l)h{a)\P{V) {(fi non-decreasing) 



> 



> 



A(zp) 



1 - $(z*) 



+ J7l + 1 — p 



A(zp)- 



m + 1 — p 



h{a) ■ {1 - 2e) ■ P{V) hy ^M) 
h{a) ■ (1 - 3e) 



(2.31) 



for sufficiently large a since P{V) -> 1 by (2.28). Using basic calculus, it can 
be shown that the expression in brackets in (2.31 ) achieves its unique minimum 
when p — 1 ~ $(z*), hence 



A(z* 



1 ~ $(z*) 



+ m + $(z*) 



h{a) ■ (1 - 3e) 



for sufficiently large a. Since e was arbitrary, this completes the proof of (2.201 
and hence the theorem once we verify that (p{-) is non-decreasing. 

Fix a > and let < y < y'. Let (T'^"*), M'(™)) denote the continuation 
of S* after the (m — l)st stage that uses the same probability of being over 
the boundary at the end of each stage as (T**^™), M*^'")) when starting from 
a-X*^_^^y'. Then 

S(Af'("')|a - X;„_i =y)= EiM*^"'^\a - X*^^^ = y') (2.32) 

and, letting 

p, = F(M*(") = l|a - = y') = P{M'^"^^ = l\a - X:,_, = y), 

we have 

E[{X^,„^, - y)l{M'(™) = l}|a - X*^_, = y] ^ A(zpJ^t(y, z^J 

< M^Pi)\/i{y',^Pi) since y < y' 
= - y')l{M*'"^ = 1}I« - X*^-i = y']- 

Similar arguments inductively give 

ii;[(XM-(™,-y)l{M'(") > l}\a~-X*^^, =y]< E[{XMHr.^^y')l{M*^^^^ > l}\a~X*^_ 
and these last two bounds show 

E{X^,,„„, - y\a - X;„_i - y) < EiX^j,^^, - y'\a - X*„^_, = y'). (2.33) 
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Then 

^{y) < E[^i-\X^,,^^, - y) + /i(a)M'('")|a - = y] (optimality of (T*(™), M*(- 

< E[^l-\XuH^■, - v') + /i(a)Af*(™)|a - Xl_^ = y'] (by and ( p^Sp 

finishing the proof. □ 



The final resuh of this section is a type of converse to Theorem 2.6 showing 
that good samplers must behave like ^mh^^mz i'^ ^^^t only the sense that m 
stages are necessary when h G B^m but also that any efficient sampler must 
follow the same "schedule" that 5°-^ and (5+ ^ follow for the first m — 1 stages, 
described in Lemma l277l 



Theorem 2.9. Assume that h £ Bm and let 
S(a) = 



where z* is as in (2.21). If S'{a) = {T,M) is a sampler such that there is a 

hounded below 1 (2.34) 

R{8'{a,))^R{5{a,)) 



sequence — ?■ oo with 

P [a, -Xu>{l- £)(1/m)'"'" )(a,) 
for some 1 < k < m and e > 0, then 



h{ai 



(2.35) 



as i — > cxD. In particular, (2.35) holds if P{M > m) 1. 



Proof. If (2.34 1 holds then Lemma 2.7 implies that 



RjS'jai)) 

h{ai) 



oo 



as i — cxD. We know that R{S{ai)) — 0{h{ai)) by Theorem 2.6 

R{5'{ai))^R{5(a^) R{S'{ai}) 



so 



h{ai) 

as i — >■ cx), proving the first claim. 
If 



h{ai) 



0(1) 



P[a-Xu>{l~e){l/^^f-'-^Fl^\ay^ 
for all 1 < fc < TO, then 

P{M>m) = P{a-Xm-i>0) 

> P{a- X,„_i > (1 - £)(1/m)1-^"""" V/J-^V)) 
^ 1, 



-oo 



proving the second assertion. 



□ 
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3 AN APPLICATION TO HYPOTHESIS TEST- 
ING 



In this section we discuss how the above multistage samplers can be used to 
construct efficient multistage hypothesis tests and give a numerical example 
of the performance of such a test. Let Yi,Y2, . . . be i.i.d. with density / and 
consider testing the simple hypotheses 

Ho-.f^fo vs. Hi-.f^h (3.1) 

in stages. We can describe multistage tests of these hypotheses by triples 
{N, M, D), where N is the 

total number of observations, M is the total number of stages 

used, and D is the decision variable, taking values in 

{0,1}. One measure of the performance of {N,M,D) in testing Hq,Hi is 
the integrated risk, which we define as 

1 

r{N, M, D) = Y^cE^N + dE,M + WiPi{D ^ i)]TTi, (3.2) 

where < c, d < 1 represent the cost per observation and cost per stage, 
TTj is the prior distribution on {/o, /i}, and Wi > 

represent the penalty for a wrong decision. Here Ei and Pi denote expecta- 
tion and probability imder fi. 

The multistage samplers discussed above, with some simple modifications 
(like making sure each stage size is an integer), tell us how to sample the Yj 
in stages by observing some random process until it crosses a boundary. The 
relevant random process here, taking the place of the Brownian motion above, 
is the log-likelihood process, which we now define. 

Assume that 

and let 

Define the log-likelihood process 

-.(")-.-i:.o.(£|j). 

Note that, like the Brownian motion above, EiXi{n) = njii > and VariXi(n) = 
n. Thus, we can use a given multistage sampler 6' with boundary a to sample 
the Yj by treating /j as the true, underlying density and sampling according to 
5' until Xi{n) > a at the end of a stage. If a is large, this is compelling evidence 
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Table 1. Results for testing /i = .25 vs. /i = —.25 about a Gaussian mean, 
with d = .001, TTi = 1/2, = 1. 



Procedure 


EN 


EM 


r 


r{S)/r (%) 






d/c^ 


1 




S 


62.2 


5.2 


.068 


100 


^9(1) 


57.5 


57.5 


.115 


59.1 




64.9 


4.6 


.073 


92.9 


Sgm 


76.7 


2.6 


.080 


85.0 






d/c = 


5 




S 


68.3 


2.9 


.017 


100 




57.5 


57.5 


.070 


24.0 


dg{22) 


72.7 


3.3 


.018 


92.8 




83.6 


1.9 


.019 


88.4 






d/c^ 


10 




S 


76.6 


1.9 


.0097 


100 


'5.(1) 


57.5 


57.5 


.0644 


15.1 


Sgi37) 


80.5 


2.2 


.0104 


93.3 


'5,(74) 


97.6 


1.3 


.0112 


86.6 



that fi is indeed the true density, and thus provides us with a "one decision" test 



of the hypotheses (3.1). We can construct an ordinary "two decision" test from 
two samplers. So and 6i, by somehow choosing a first stage size, computing the 
maximum likelihood estimated of the true hypothesis i G {0, 1} from the data 
observed in that first stage, and then continuing with (5j and a slightly modified 
stopping rule (e.g., "stop when > a") to protect against an error in i. 

A natural choice for the size of the first stage is the minimum of the two stage 
sizes dictated by Sq, Si. 

Let S be the test constructed in this manner from the two samplers S°^, d/ci^i)^ 
i = 0, 1, where = a^^ logc?^^ and 

m* = inf {?7i > 1 : Hm{p-i)K%{ai) <d/c< K,„+i(/ij)/i„j+i(ai)} . 

Bartroff (2004, 2005) shows that, under general conditions on the fi, S minimizes 
the integrated risk ( |3.2[ ) to second order as c, cZ — >■ at specified rates. Bartroff 
(2004) and a third paper by the author in this series will extend these results to 
composite hypotheses about the parameter of an exponential family. See these 
references for more details on the testing problem. 

Table 1 contains the results of a numerical experiment comparing S with 
group-sequential (i.e., constant stage-size) testing of the hypotheses fi = .25 
vs. fi = —.25, about the mean of Gaussian random variables with unit variance. 
Sg(k) denotes group-sequential testing with constant stage-size k, which samples 
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until 



5: log 



fo(Y,) 



> logd 



at the end of a stage, which is equivalent to the stopping rule of 5. For each 
value of d/c, the operating characteristics of Sg{k) are given for three values 
of k: k = I, the best possible k (determined by simulation), and two times 
the best possible k. The results show significant improvement in the integrated 
risk of the variable stage-size test i5 upon Sg. This improvement decreases for 
large values of d/c, but this is to be expected since the number of stages of any 
reasonable test will approach 1 in this limit. 

Here we constructed tests from the samplers ^. In practice, tests con- 
structed from the samplers (5+ ^ also perform well and behave almost identically 
as those constructed from ^ . The choice here was made merely to simplify 
presentation. 



A PROOFS OF THEOREMS AND LEMMAS 



Proof of Lemma \2.1\ First we will prove 

E{a- Xk\M > k) < g^'''^ (a) for all fc > 0. 
The fc = case is trivial and we have 



(A.l) 



E{a-Xk+i\M>k + l,Xk 



E[{a - Xk) - (Xk+i - Xk)\M > fc -M, Xk] 
A{-z)^tia- Xk,z)/q 

(a - ^ fc) - ^J■tia - Xk,z) 



qz 



g{a- Xk). 



g is increasing and concave, so by Jensen's inequality and the induction hypoth- 
esis 



E{a-Xk+i\M > fc + 1) 



= E{g{a ~ Xk)\M > k + 1) 
< giE{a - Xk\M > k + 1)) 
= g{E{a - Xk\M > k)) 



(A.2) 



proving (jATJ). In ( |A.2[ ) we use that E{a - Xk\M > fc + 1) = E{a - Xk\M > fc); 
this is true since the value of Xk and the number of additional stages required 
to cross the boundary are independent, as long as Xk < a. 

We now prove (2.13). Let p — 1 — q — $(— z). Assume first that z > so 



that p < 1/2. E{Ti\M > 1) = t{a, z) and for fc > 2, 

E{Tk\M>k) = E{t{a~ Xk-i,z)\M > k- 1) 

< fi^^E{a- Xk-i\M > k-l) since p< 1/2 
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by (A. 11. Using these two relations 

ni m m 

E{T\M = m) = ^ii;(rfc|M = m) = ^ii;(Tfc|M > fc) < t(a, ^ gC^-^H 

k = l k=l k=2 

since E{Tk\M = m) = E{Tk\M > k) for any to > fc as discussed above. Thus 
ET = E{E{T\M)) 

m 

m>2 fe=2 

= t{a, z) + ^1^^ g'^^\a)q^ by reversing order of summation 

k>l 

using the relation between g and <(•, z) in ( |2.12 ). 

Now assume z < 0. Consequently, t{-,z) is concave, so using Jensen's in- 
equality and (A.l), 

E{Tk\M>k) = E[t{a- Xk-i,z)\M > k-l] 

< t{E[a- Xk-i\M > k-l],z) < t{g^''-^^a),z) 

and, as computed above, 

ET = E{E{T\M)) < J2 q-'-'pf^tig^'-'H^):^) = ^ S^'H")?' 

m>l k=l ^ ^ fc>l 



again using (2.121 for the final step. □ 



Proof of Lemma 2.2 , Let z = z{a) and g be as in Lemma 2.1 with p — ^i~z) 
and q = I — p. A simple computation shows that g{a) has a unique positive 
fixed point 

A(-z)0(z) 
a; = X (a) = 5 

so that g{a) < (a V a;*), where V denotes max. Then 

EXj^i — a — fJ-{ET — a/^) Wald's equation 
qA{z) 



00 



^ Xr^E5*'H«)'^'"' byLemma[2T 
^ fc=i 



< 



qMz) I E^i 9{a)q = 9{a)/p < '2g{a) when a > 
A(-z) ' a;*?''"^ = a;*/p < 2x* when a < x*, 

since .g(a) < (a V x*) 
- 2-^^-(.9(«)Vx*). (A.3) 
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Now 



(A.4) 



by (2.10) and since \z\~ o{^/a). Also 



A(-z) 



gA(z) A(-z)0(z) _ A(z)0(z) 
A(-z) ■ 
|z|0(z) 



^$(z) 
= 0(z2) by (plTl 

- 0(|z|V«) 



by (2.10) and since q — $(z) 



since \z\ = o{^/a). Plugging (A.4) and (A. 5) into (A. 3) gives the claim. 



(A.5) 

□ 



Proof of Theorem |2.3[ We proceed by induction on m. For m — 1, assume 
he B1 and let Sl^{a) = {T,M). Note that 



ai/4 

so Lemma [2 . 2 1 applies . Then 



ll/4 



\/a = o(l) • ^/a = o{^/a), 



ET—a/fi = ^ ^{EXm — o) by Wald's equation 
= 0(|C(a)|Va) by Lemma ([2^) 

• \/a^ by definition of C, 
■h{a)\ 



showing that the theorem holds for m = 1. 

Now assume that h e and let 6^^^ f^{a) = (T, M). Let ^ = ^log(a//i(a)2 + 1) 

and t = i(a,^), the size of the first stage of S°^^^ Note that 


















o(l) • h{a) 


o{h{a)), 



hiay 



fern (a) 



> 



h{a) J hi{af 



h{a) 



1 



so that ^ — >■ cx), which is important in the definition of the first stage of 5°^^-^ ^. 
Using the definition of /, as a; — > oo 

hmifix)) = 0((xloga;)(i/2)"^\log(xloga;))i/2-(i/2)'") 

= 0(:r(i/2)'""\logx)i/2-(i/^)'""') = 0{Ka+,{x)) ^ o{h{x)), 
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or equivalently, h„i = oiho f A similar argument shows ho f ^ = o{h„i-i), 
which shows that 

hof-'eB°^. (A.6) 
For y > let S°^^^^^,,{y) = (r,M') and 



4'iy) 



EM' 

EXm' - y. 



Suppose e > 0. By the induction hypothesis there is a constant j/o such that 



\il}{y)~m\ < e 

^{y) < e-h{r\y)) 



(A.7) 
(A.8) 



for all y > j/o, where (A.8) uses (2.141 and Wald's equation 
EM -1 



E{M -l;0<a-Xi<yo)+ E{M - 1; a - ATi > y^) 
E{i,{a - Xi); <a-Xi<yo) + E{^{a - X^); > yjj^.g) 



Letting A = supo<j,<j,^ ipiy) < oo, 
E{^Pia-Xi);0<a-Xi<y,] 



< A- P{Q < a- Xi <yo) 

< A ■ P{Xi >a-yo) 



A-P 



Xi - 

= A -0(1) =0(1), 



Xi — lit ^ a — lit 



Vo 



A-P 



Vi Vi 
>e + o(l) 



(A.IO) 



by, say, Chebyshev's inequality since ^ — > oo. Plugging this into (A.9|, 



\EM-{m + l)\ < oil) + \E{ij{a~Xi)-m;a-Xi>yo)\+m,-P{a-Xi<yo) 

< o(l) + E{\ij{a - Xi)-m\;a- Xi>yo)+m- o(l) 

< o{l)+e-P{a-Xi>yo) + o{l) by ^ 

< £ + £ + £ = 3e 

for sufficiently large a. This shows that EM m+ 1. 
To show that (2.151 holds for ni + 1, first write 

liE(T — a//i) = EX]\i — a 

= E{Xm - a; a/ = 1) + E{Xm - a;0 < a - Xi < y^) 

+E{Xm -a;yo<a-Xi< (l/2)/(a)) + E{Xm -a;a-Xi> (l/2)/(a)) 
= I + II + III + IV. (A. 11) 
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First, 



/ = E{Xi -a;Xi> a) 



Vt 



e 



by p.lOD 

0{h{a)/^) 

e 



(since t ^ a/fi and (/)(^) cx hj ^/a) 
o{h{a)). (A.12) 



Next we have 



// = E{ip{a - Xi); <a~Xi<yo)^ o(l) 



(A.13) 



by the same argument leading to (A. 10 1. 

Before considering note that we may assume without loss of generality 
that h is non-decreasing. Otherwise, we could replace h by h(x) = infj,>a: h{y) 
throughout the proof, since h is non-decreasing and bounded above by h. Since 
/^^ is also non-decreasing, ho f^^ is thus non-decreasing. Now 



/// 



E{ipia - Xi);yo < a ~ Xi < (l/2)/(a)) 



< E{e ■h{r\a~X^));y,< a- X^< {1/2) f{a)) by ([Ol 

< e • ft-(/~^((l/2)/(a))) /i o non-decreasing 

< e ■ h{f~^{f{a))) /^^non-decreasing 

= e-h{a). (A.14) 

We know that h{f^^{x)) ~ o{hm~i{x)) and hence h{f~^{x)) = o{x) since 
m = 1 gives the largest case, asymptotically. Thus we assume that a is large 
enough so that 

h{f-Hy))<y for aU 2/ > (l/2)/(a). (A.15) 



Then 



IV = E{if{a~Xi)-a-X^>{l/2)f{a)) 

< E{e-h{r\a-Xi)):a-Xi> {1/2) f (a)) by ([XSl 

< eE{a -Xi:a-Xi> {1/2) f {a)) by ( |Al5| 



Vt-A 



(M 

\2Vi 

using (2.8) for this last step. Now 

/(«) 



M. 

2Vi 



+ (/(a)/2)..(e-01) 



(l + o(l)) since t~a//i 



(A.16) 



> 
> 



27a//i 

v/log(a//i(a)2) + o(l) + 3v/log(a + l) • (1 + o(l)) 
v/log(a//im+i(a)2) + 2v/loga since h E 

.^log(a/a(i/2)'") + 2yioga 

v/(l-2-™)loga + 2 VTog^ > Tk^- 
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Plugging this back into (A. 16) gives 
IV < £ 



Vi ■ A(v/l^) + (/(a)/2) • $(-v/l^) 

= 0{V^)-^^y^+OiV^a)-^^^M^ by ™ and Mills' ratio 
log a Vlog a ' ' 

= O(v^)-0(v/loga) = O(l)=o(/i(a)). (A.17) 

Plugging ( |Al2l ), ( |Al3l ), (|Al4|, and ( |Al7| into ( |A1T| ) gives 

^ET - a//i < o{h{a)) + o(l) + £/i(a) + o{h{a)) < 4eh{a) 

for sufficiently large a. This shows that ET — a/ji — o{h{a)), finishing the 
proof. □ 

Proof of Lemma 12. 4i 

\og{K^+^/K^) = -(1/2)™+! logM + ^(1/2)^+1 log[(l/2)^ 



(1/2)^ 



i=l 



m— 1 



^ (1/2)^+1 log[(l/2)™-i-' - (1/2)™-!] 



= -(l/2)'"+Mog/i + (l/2)™+Mog[l -2'™] - ^(l/2)*+Mog2 

1=1 

= (l/2)"+i log(l - 2-™)^ + (1/2 - (1/2)™) log(l/2). 

On the other hand, letting y — ^(1 — 2^™-)/ ji ■ xlogx, 

\og{h^{y)/h^+i{x)) = (1/2)" log 2/ +(1/2 -(1/2)™) log log 2/ 

-(l/2)'"+Moga;- (1/2- (l/2)™+i)loglogx 

= (l/2)™+i[log(l - (l/2)")//i + logs + log log x] 
+ (1/2 - (l/2)™)[log(l/2) + log log X + 0(1)] 
-(1/2)™+! logo; - (1/2 - (1/2)™+!) log log X 

= (1/2)™+! i^g(^ _ 2-™)/a* + (1/2 - (1/2)™) log(l/2) + 0(1) 

= l0g(K„i+i/Km) + 0(1) 



SO that h„i{y)/hm+i{x) k,„+i/k„ 



□ 



2.5 



We proceed by induction on m. Let 5i^^{a) — {T, M). 



Proof of Theorem 

Using Wald's equation^ 

^iE{T - a/^l) = E{Xm - a; M = 1) + E{Xm - a; M > 1). 



Let t — t{a, z), the size of the first stage of 5+^(a). Using (2.10), 

E{Xm - a; Af = 1) = E{Xi -a;Xi> a) 
= Vi ■ A(z) by ^ 
^ \J aj [1 ■ A(z) 



(A.18) 
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since t ^ a/fi. For y > let 



Suiy)iy) = (T',M'), 
ip{y) = EXm' - y, 
^{y) = EM', 



where i/(y) = —y^\og{y + 1) as in the definition of Si^. Suppose e > 0. By 
Lemma 2.2 and since EAI' = l/$(|zy(?/)|) — >■ 1 as y — J' cx) there are constants 
j/o, C such that for all y > yo, 

ifiy) < C^ylogiy + 1) (A.19) 
my)-l\ < e. (A.20) 



Let y = a - Xi . Note that 

P{Y > yo) = Hz - Vo/Vi) ^ $(z) 

by continuity, and 



P{Y < 0) = 1 - $(z) 



(A.21) 
(A.22) 



so that P{0 <Y <yo) 
Now 



0. 



E{XM-a]M>l) = E{ip{Y)-Y>Q) 

= E{^{Y)-Q<Y <yo) + E{^{Y)-Y>yo) 

= o{l)+E{C^Y\og{Y + l)-Y>yo) by|X22] 

< o(l) + C^E{Y\Y > yo) log[E{Y\Y > y„) + 1], 

where this last uses concavity of y i— > ylog{y + 1) with Jensen's inequality. Now 

EiY\Y > yo) = P{Y > yo)-^E{Y- Y > yo) 

= [<Piz) + o{l)]-^ -[Vt- A{-z + yo/Vt)+yo-'^iz~yo/Vt)] using ^ 
= 0(l)[0(^)-0(l) + yo-0(l)] =0(^^). 

Thus 

E{Xm ~a;M>l) < o(l) + C^O{V^) \ogO{V^) 
= o(Va), 



and combining this with (A. 18 1 gives 

ET - a/fi < A(z)^"3/2y^ + o(v^) = (1 + o{l))A{z)Kihi{a), 



establishing (2.17) for m = 1. 

\EM - (1 + $(z))| = |1 + E{tP{Y);Y > 0) - (1 + $(z))| 

< \EiiPiY);0 <Y<yo)\+ E{\ij{Y) -l\;Y>yo) + \P{Y > yo) - $(z)| 

< o(l)+e + o(l) by (|A^ and (|A^ 

< 3e 
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for sufficiently large a, proving that EM — > f + <&(z). 

Next let <5++i..(a) = iT,M) and = {T',M') for y > 0. Let 

ip{y) = EXm' - y, 
My) = EM', 

and suppose e > 0. By the induction hypothesis there is a constant yi such 
that, for all y >yi, 

fiy) < (l + £)/iA(z)K™/i™(y) (A.23) 
IV'(y)- (m + $(z))| < e. (A.24) 

Let t — t{a, ■\/(i — 2^™) log(a + 1)), the size of the first stage of <5^+iz(a)- 
iin letting Y ~ a ~ Xi, 



nE{T-a/fi) = E{Xm - a;M = 1) + E{Xm - a;M > I) 

= EiXi -a;Xi>a)+ -B(</j(r); < F < + F ^2^^) 

Now 

E{Xi ~ a; Xi > a) = Vt-A{^/{1 - 2-"') log(a + 1)) by ([2j| 

^ (/)(v/(l-2-'")log(a + l)) _, 
~ V«/a^ 7^ , — by(|2.i0l 



(i-2-™)log(a+l) 
0(a-V2+(i/2)'"+^) 

(1 - 2-") log(a + 1) 



o{a 



(1/2)" 



o(/i™+i(a)). 



(A.26) 



< Y < yi) = o(i) by a now routine argument and using (A.23), 

E{^{Y);Y>y,) 



(1 + e)fj,A{z)K„ 



< E{hm{Y)-Y>y^) 



< h^{E{Y-Y>yi)P{Y>yi)-') 
by Jensen's inequality since /i^ is concave. Now 

E{Y;Y> yi) ^ Vt-A(- ^(1 - 2-™) log(a + i)) ^ y^V(l - 2-™) log(a + 1) 



by a routine application of (2.8). Using that P{Y > yi) 1, for sufficiently 
large a we have 



E{^{Y)-Y>yi) < {l + eftiA{z)K„MVaU^) 
< {1 + e)^fiA{z)K^+ih,n+i{a), 



(A.27) 



using Lemma 2.4 Plugging (A.26) and (A.27) into (A. 25) gives 

/im+l (l) 

< [e + e+(l+ef]A(z) 

< [1 + 9e]A{z)Km+iKn+i{a) 
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for sufficiently large a, which shows that (2.171 holds for to + 1. As for the 
number of stages used, 

\EM - {m + l + <i>{z))\ = \l + E{^j{Y);Y > 0) - (to + 1 + $(z))| 

< \E{tl;{Y);0 <Y < yi)\ + E{\^j{Y) - (to + $(z))| ;r > y^) + {m + <i>{z))P{Y < 

< o(l) + £-P(r >?/i) + (to + $(z))-o(1) by ( |X24l ) and since P{Y < yi) ^ 

< o(l) + £ + o(l) < 3£ 

for sufficiently large a. This shows that EM m + 1 + $(z) and completes the 
proof. □ 



Proof of Lemma 



2.7 



Let Gfc = (l//.)i-2-'=pW 
Vk = {a-Xk>{l- e)Gk} 



F^^(a). Suppose e > 0. Let 



The fc = case is trivial since (2.22) is equivalent to P{a > a) ^ 1. Fix 

; P 



1 < k < m and assume that P{Vk-i) 1. Let 5{a) = (T, M) and 

a — Xk-i — /iTfc 



Note that 



h{aY = o{h„i-i{a)^) since h S Bm 

= o(F^'""^^(a)) by Lemma [2:81 

= o(G,„_2) 
- o{Gk-i) 

since fc — 1 < to — 2. Thus Gk-i/h? — >■ oo and so does \og{Gk^i/h?). With this, 
we claim 



P{Ck > v'log(Gfe_i//i2(a)) - l\Vk-i) ^ 1. 



(A.28) 



Let £, = y^\og{Gk-i/h^{a)) - 1 and [/ = {C^ < If ( |A.28| were to fail there 
would be a constant ry > and a sequence of a's approaching oo on which 
P(f/|l^fe_i) > T]. Then 

/xi?(5(a)) > nE{T-a/fi)^E{XM~a)>E[{XM-a)l{M^k};UnVk-i] 
= S[A(Cfe) Vi(a - ^fc-i, Cfe); U n l^fe_i]. (A.29) 



The function inside the expectation in (A.29 ) is decreasing in both Cj, and A'/j_i, 
hence 

fiRid{a)) > A(e) v/i((l - £)Gk-iA) ■ P{U D Vk-i). (A.30) 
By assumption, P{U\Vk-i) > rj and P{Vk-i) 1, so 

P([/nl^fc_i) >??/2, (A.31) 
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say, for large enough a. Also 

A(e)v/^((l-£)Gfe_i,0 



m 



v/(l - £)Gfc_i/Ai 



. , exp(v/log(Gfe_i//.2(a))-l/2) 
> £ 'i'(a) — : — — some £ > 



h{a)/o{l). 



(A.32) 



Plugging ( |A.31[ ) and ( |A.32[ ) into ( |A.30[ ) gives /i(a) = o(i?(( 5(a))) which con- 
tradicts our assumption that R{S{a)) — 0{h{a)). Hence, (A. 28) must hold. 
Then 



P{Vk\U' n Vk-i) = P{a - Xfc > (1 - e)Gk\U' n 14_i) 

= p 



(ATfe - - ^Tfc ^ Q - ^fc-i - (1 - £)Gfc - /iTfe 



On Vk-i, 

a - Xk-i - (1 - e)Gk - ^J.Tk 



(a'.33) 



= Ck 



2m(1 - e)Gk 



which is increasing in Cfe- Hence, on U', 
2^(1 - £)Gfe 



v/4M(l-£)Gfc-i+C,'-a 
2Ai(l - £)Gfc 



2/i(l-£)Gfc 
y/ifi{l~e)Gk-i+Cl-Ck' 

2^,(1 - £)Gfc 
v/4/i(l-£)Gfc_i+e2-e 



(1 + «(!)) 



V4m(1 - £)Gfc_i 
e - V'(l-£)log(Pf-^)(a)//i2(a))(l + 0(1)) 
(1 - VT^)V'^og{Gk-i/h^a)) = 7 ^ oo. 



Substituting this back into (A. 33 1 gives 



p{Vk\u'nVk-i)>i-{j/2y 



1 



by Chebyshev's inequality. Thus F(Vfc) > P(yfc|?7' n Vk-i )P{U ' D Vk-i) ^ 1 
since P{U' n Vfe-i) — ?> 1 by the induction hypothesis and (A. 28), finishing the 
induction and proving the lemma. 



□ 



2.8 



Let F'' denote fI^\x). Fhst we prove (^2.26^ by 



Proof of Lemma 

induction on k. For fc = 1, 

= ^/^ = 1 • = GJ" • /ii(a;) 
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Now assume 2 < fc+1 < to, CJI'hkix), and let Q = lim/i//i„ G (0, oo). 

Observe that 



log 



k-1 



h{xy 



log 
log 
log 



hk{xf 



hm{x)' 

' x(V^)'"(logx)i-(V^)'" 

x(l/2)'"-i(logx)l-(V2)"-i 



[(l/2)'=-i-(l/2r-i]log 



(A.34) 



so 



{(Cr/ife(x))2[(l/2)^-i - (l/2)"-i]logx}'/' 

yCf •:E(i/2)'"^\log:r)i/4-(i/2)'°+'[(l/2)'=-i-(l/2r-i]i/4(ioga:)i/4 
• [(l/2)'=-i - (l/2)™-Y/'/^fc+i(a;) 

(A.35) 



by (2.23). 



Next we prove (2.27) by induction on k. The k = \ case is again easy since 



for any to > 2. Now assume 2 < fc + 1 < to and that (2.27) holds for k. Then, 
since /im <C /i <C /im.-i, 



log 



V hm[xf 



[(l/2)'=-i-(l/2r-i]logx. 



by the same argument leading to (A.34 1. Then, by repeating the argument 
leading to (A.35 1 with < in place of ^, 



by (2.23). The other bound is similar: 



log 
and so 



h{xf 



^ log 



K%-\{xY 



[(l/2)'=-^-(l/2r-ilogx, 



[(1/2)'-^ - (l/2)™-2]/ife+i(.x) = 



by replacing to by to — 1 in (A.35) and (2.23). 



□ 
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